A Parsimony Approach to Analysis of Human Segmental Duplications

نویسندگان

  • Crystal L. Kahn
  • Benjamin J. Raphael
چکیده

Segmental duplications are abundant in the human genome, but their evolutionary history is not well-understood. The mystery surrounding them is due in part to their complex organization; many segmental duplications are mosaic patterns of smaller repeated segments, or duplicons. A two-step model of duplication has been proposed to explain these mosaic patterns. In this model, duplicons are copied and aggregated into primary duplication blocks that subsequently seed secondary duplications. Here, we formalize the problem of computing a duplication scenario that is consistent with the two-step model. We first describe a dynamic programming algorithm to compute the duplication distance between two strings. We then use this distance as the cost function in an integer linear program to obtain the most parsimonious duplication scenario. We apply our method to derive putative ancestral relationships between segmental duplications in the human genome.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Parsimony and likelihood reconstruction of human segmental duplications

MOTIVATION Segmental duplications > 1 kb in length with >or= 90% sequence identity between copies comprise nearly 5% of the human genome. They are frequently found in large, contiguous regions known as duplication blocks that can contain mosaic patterns of thousands of segmental duplications. Reconstructing the evolutionary history of these complex genomic regions is a non-trivial, but importan...

متن کامل

Optimizing Directed Acyclic Graphs via Simulated Annealing for Reconstructing Human Segmental Duplications

Segmental duplications, relatively long and nearly identical regions, prevalent in the mammalian genome, are successfully modeled by directed acyclic graphs. Reconstructing the evolutionary history of these genomic regions is a non-trivial, but important task, as segmental duplications harbor recent primate-specific and human-specific innovations and also mediate copy number variation within th...

متن کامل

Efficient Algorithms for Analyzing Segmental Duplications, Deletions, and Inversions in Genomes

Segmental duplications, or low-copy repeats, are common in mammalian genomes. In the human genome, most segmental duplications are mosaics consisting of pieces of multiple other segmental duplications. This complex genomic organization complicates analysis of the evolutionary history of these sequences. Earlier, we introduced a genomic distance, called duplication distance, that computes the mo...

متن کامل

Segmental Duplications as a Complement Strategy to Short Tandem Repeats in the Prenatal Diagnosis of Down Syndrome

Background: Quantitative fluorescence-polymerase chain reaction (QF-PCR) is an inexpensive and accurate method for the prenatal diagnosis of aneuploidies that applies short tandem repeats (STRs) as a chromosome-specific marker. Despite its apparent advantages, QF-PCR is not applicable in all cases due to the presence of uninformative STRs. This study was carried out to investigate the efficienc...

متن کامل

Analysis of segmental duplications and genome assembly in the mouse.

Limited comparative studies suggest that the human genome is particularly enriched for recent segmental duplications. The extent of segmental duplications in other mammalian genomes is unknown and confounded by methodological differences in genome assembly. Here, we present a detailed analysis of recent duplication content within the mouse genome using a whole-genome assembly comparison method ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing

دوره   شماره 

صفحات  -

تاریخ انتشار 2009